Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Outside the cave of shadows: using syntactic annotation to enhance authorship attribution

Identifieur interne : 002766 ( Main/Exploration ); précédent : 002765; suivant : 002767

Outside the cave of shadows: using syntactic annotation to enhance authorship attribution

Auteurs : H. Baayen [Pays-Bas] ; H. Van Halteren [Pays-Bas] ; F. Tweedie [Royaume-Uni]

Source :

RBID : ISTEX:5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE

Abstract

This paper reports an experiment in authorship attribution in which statistical measures and methods that have been widely applied to words and their frequencies of use are applied to rewrite rules as they appear in a syntactically annotated corpus. The outcome of this experiment suggests that the frequencies with which syntactic rewrite rules are put to use provide a better clue to authorship than word usage. Complementary methods focusing on the high-frequency head and the low-frequency tail of the distribution independently reveal a higher resolution than traditional word-based analyses, and promise enhanced accuracy for authorship attribution.

Url:
DOI: 10.1093/llc/11.3.121


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Outside the cave of shadows: using syntactic annotation to enhance authorship attribution</title>
<author>
<name sortKey="Baayen, H" sort="Baayen, H" uniqKey="Baayen H" first="H" last="Baayen">H. Baayen</name>
</author>
<author>
<name sortKey="Van Halteren, H" sort="Van Halteren, H" uniqKey="Van Halteren H" first="H" last="Van Halteren">H. Van Halteren</name>
</author>
<author>
<name sortKey="Tweedie, F" sort="Tweedie, F" uniqKey="Tweedie F" first="F" last="Tweedie">F. Tweedie</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE</idno>
<date when="1996" year="1996">1996</date>
<idno type="doi">10.1093/llc/11.3.121</idno>
<idno type="url">https://api.istex.fr/document/5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002796</idno>
<idno type="wicri:Area/Istex/Curation">002607</idno>
<idno type="wicri:Area/Istex/Checkpoint">001B77</idno>
<idno type="wicri:doubleKey">0268-1145:1996:Baayen H:outside:the:cave</idno>
<idno type="wicri:Area/Main/Merge">002910</idno>
<idno type="wicri:Area/Main/Curation">002766</idno>
<idno type="wicri:Area/Main/Exploration">002766</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Outside the cave of shadows: using syntactic annotation to enhance authorship attribution</title>
<author>
<name sortKey="Baayen, H" sort="Baayen, H" uniqKey="Baayen H" first="H" last="Baayen">H. Baayen</name>
<affiliation wicri:level="3">
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Corresponding author at: Max Planck Institute for Psycholinguistics, Wundtlaan 1, 6525XD, Nijmegen</wicri:regionArea>
<placeName>
<settlement type="city">Nimègue</settlement>
<region type="province" nuts="2">Gueldre</region>
</placeName>
</affiliation>
<affiliation>
<wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Van Halteren, H" sort="Van Halteren, H" uniqKey="Van Halteren H" first="H" last="Van Halteren">H. Van Halteren</name>
<affiliation wicri:level="3">
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Catholic University of Nijmegen, Nijmegen</wicri:regionArea>
<placeName>
<settlement type="city">Nimègue</settlement>
<region type="province" nuts="2">Gueldre</region>
</placeName>
</affiliation>
<affiliation>
<wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
<author>
<name sortKey="Tweedie, F" sort="Tweedie, F" uniqKey="Tweedie F" first="F" last="Tweedie">F. Tweedie</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>The University of the West of England, Bristol</wicri:regionArea>
<wicri:noRegion>Bristol</wicri:noRegion>
</affiliation>
<affiliation>
<wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="1996-09">1996-09</date>
<biblScope unit="volume">11</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="121">121</biblScope>
<biblScope unit="page" to="132">132</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE</idno>
<idno type="DOI">10.1093/llc/11.3.121</idno>
<idno type="local">2</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">This paper reports an experiment in authorship attribution in which statistical measures and methods that have been widely applied to words and their frequencies of use are applied to rewrite rules as they appear in a syntactically annotated corpus. The outcome of this experiment suggests that the frequencies with which syntactic rewrite rules are put to use provide a better clue to authorship than word usage. Complementary methods focusing on the high-frequency head and the low-frequency tail of the distribution independently reveal a higher resolution than traditional word-based analyses, and promise enhanced accuracy for authorship attribution.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Pays-Bas</li>
<li>Royaume-Uni</li>
</country>
<region>
<li>Gueldre</li>
</region>
<settlement>
<li>Nimègue</li>
</settlement>
</list>
<tree>
<country name="Pays-Bas">
<region name="Gueldre">
<name sortKey="Baayen, H" sort="Baayen, H" uniqKey="Baayen H" first="H" last="Baayen">H. Baayen</name>
</region>
<name sortKey="Van Halteren, H" sort="Van Halteren, H" uniqKey="Van Halteren H" first="H" last="Van Halteren">H. Van Halteren</name>
</country>
<country name="Royaume-Uni">
<noRegion>
<name sortKey="Tweedie, F" sort="Tweedie, F" uniqKey="Tweedie F" first="F" last="Tweedie">F. Tweedie</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002766 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002766 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE
   |texte=   Outside the cave of shadows: using syntactic annotation to enhance authorship attribution
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024